CDS

Accession Number TCMCG058C04404
gbkey CDS
Protein Id KAF7121079.1
Location complement(join(8008689..8010912,8011282..8011361,8015702..8017942))
Organism Rhododendron simsii
locus_tag RHSIM_Rhsim13G0080300

Protein

Length 1514aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA588298, BioSample:SAMN13241185
db_source WJXA01000013.1
Definition hypothetical protein RHSIM_Rhsim13G0080300 [Rhododendron simsii]
Locus_tag RHSIM_Rhsim13G0080300

EGGNOG-MAPPER Annotation

COG_category I
Description oxidosqualene cyclase activity
KEGG_TC -
KEGG_Module -
KEGG_Reaction R03200        [VIEW IN KEGG]
R06466        [VIEW IN KEGG]
R06469        [VIEW IN KEGG]
R07322        [VIEW IN KEGG]
R07323        [VIEW IN KEGG]
KEGG_rclass RC01582        [VIEW IN KEGG]
RC01850        [VIEW IN KEGG]
RC01851        [VIEW IN KEGG]
RC01863        [VIEW IN KEGG]
RC01864        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01853        [VIEW IN KEGG]
ko:K06045        [VIEW IN KEGG]
ko:K15813        [VIEW IN KEGG]
ko:K20659        [VIEW IN KEGG]
EC 4.2.1.129        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
5.4.99.17        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
5.4.99.39        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
5.4.99.41        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
5.4.99.8        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00100        [VIEW IN KEGG]
ko00909        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00100        [VIEW IN KEGG]
map00909        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATCAACACCTCAGTCACTCTACCAATTCTATGAATGGTTTCAAGTTCAACGATGATGCCTTTTCGCCGGCATTCGATGAATCTGAAAATCTCGCAAACAGATTCAATTTTGAAGATGACACTAGAGATCTTAACTTCATGGAGCTTCCTTATCCTCCTCATGATCCCTACCCTGGCTTTGTGAATTTAATATCCAGTGGAAGCTCTGAAGTGGATTCTTCAGATGATATTGGCCTTTCCGATAATATTTTAAAGTTCATAACCACAATACTCATGGAGGAGAACATGGAGCAGAAGCCTAGTATGTTCCATGATCCTTTAGCCCTACAAGCTACTGAGAAATCCTTATATGATGTCATTGGAGAAAAGTACCCAGTTTCCACAAGTCAACCCCCTCTTTACTTCAATCACATTGTCGAAAGCCCAGATGAGTCTTTCTTTGGTAGTTCCAGTGAGCATAGTAGTAACAGTAATTCTAGTGGGTCTTTCTCTGTTGATTCTCAATGGAATGTTGATCCTGGAGAGTATCAGCAGTCTGTGCAAGGTTATTCTGTTGCATACCCATTTGGGACGTCATTTGGCTCTACGGATAGGTTCAGTCATAATGGTAGTGGTCATAGGACCTCTTATAAGAGTAGCCATTTGGTTCCAGACATATTTAGCGATCGTCAGTCCATCTTGGAATTCAATAAAGGGGTAGAGGAAGCTAGCAAGTTCCTTCCTCAAAATAGTCAATTGCTAATTGATTTGGAGTGCTATGACCTGCCTCCAGAATCTAATGGAGGGGCTCCGGAGGTTGTAGTCAACGTGGAGGATACAAGGGAATCCTCGCCCAATGGATCACGGGGTAAGAAGTATCATCACCGCCAGGATAGTTGTAATGTAGAAGAAGAGAGGAATAGCAAGTTTTCAGCAGTTTATGACGACGAGTCCGAGTTGTCTGAGATGTTTGATAAGGTTTTGCTAATCGATCCAAAAGTGGAGGGTGCACACTGTCATAGTGATGAAAAAGTACTGAGTGAAGCTTCGAAGGCCACCCAAGTGACTGGACAACCCCAGGGATCTAAAGGCGGAAAGATTCGTGGTAAAAAACGGGGAAACAAAACCGAAGTGGTCGATTTGAGAACACTCTTGTCCAATTGTGCACAATCTGTTGCTGCTAATGATCGCAGGACTGCTAATGAACAACTAAAGCAGATTAGGCAGCACTCCTCTCCTTCCGGTGATGGGTTTCAGAGGTTGGCCAATGTTTTTGCCAATGGCCTCGAGGCCCGTTTGGATGGGACCGGAACCCAAATCTATGCAGCCCTTGCTTCCAAGAGAATTTCAGCTGCTGAAAAGTTAAAAGCTTATCAACTTTTTCTTTCATGTTGTCCTTACAAGAAGATGTCGATATTCTTTGCAAACAAAACGATTGTAGATTTAGCTATGAATTCTAGCAAAAAGAAGCTTCACATTATAGACTTTGGAATCGTATATGGTTTCCAGTGGCCCATGGTTATCCAATATCTGTCAACGATTCCTGGTGGGCCCCCTGAACTTCGAATTACGGGGATAGAGCTTCCACAACCCGGTTTTCGTCCAGCAGAGCTTGTTGAGGAGACAGGGCGTCGCTTGGCAAAGTATTGTGAACGCTTTAATGTTCCATTCAAATATCATGCCATAGCACAGAAGTGGGAAACTATCAAAATTGAGGACCTCAACCTGCACGAGGACGAGGTGCTTGCTGTGAACTGCCTCTTCCAGTTTAAGAACCTACTTGATGAGACAGTAACGGATGATAGCCCAAGAGATGCTGTTTTGAAGTTAATTAGGAAGATGAATCCAGACAGTTTCATCCAATCAGTCATTAATGGATCTTTCAATTCCCCTTTCTTTGTCACTCGGTTTCGTGAGGCTCTCTTCCACTACTCTTCACTGTTTGATATGTTTGACACTAATATACCCCGTGACAATCAAGAGAGGATGGATTTCGAGCAAGAGTTCTGTGGGCGTGAGATTACGAATGTAATCGCGTGTGAGGGGGTGGAAAGGGTAGAGAAGCCTGAGACATACAAGCAGTGGCAAGTTCGCACTATGAGGGCTGGTTTGAAGATGCTCCCATTGTCACAGGAGAATTTGAAGAAGTTCAGGCGTAAGGTGAAGGCACATTATCATAATGATTTTGTGATCGAGGAAGATGGTCAATGGATGCTGCAAGGATGGAAGGGTCGGATTTTCTGTGCTAGCTCCTGCTGGACTTTGTCGTTGTCTTCCACTTCCATTACTATATTGCTCCTCTCGAAAAAGGGTCACTGCGCTCTTGCTCTCTCTCCAAGGGCTGTTGAGTTGCATAAGATCATGGATCCACATTTCAGTGCATTCTCCTATCCGTCAAACGGTTTCGAATTCGATGATGGTGCCTTCTCACCCACTTTCCGTCCGTCCGAAAATGTCGCCAACACATTCCCATTTCGAGATGACCCTGGAGATCTTAGCATAATGGACAAATTTCTTCCCCCTGTTCCCAACCCCGGTTTTGCGGCTTTGACATTGGGTGGAAGCTCAGAAGTGGACTCTCCAGATGATAGTGACTTCTCTGATGCTGTTCTCAAGTTCATAAACACAACGCTCATGGAAGAGAACATGGAGCAGACACCCTCTACGTTCCACGATCCTTTGGCCCTGCGAGCTACTGAGAAATCGTTGTATGATGTTATTGGTGAGAATTATCCTGCTTCCCTGAATCAACCCCCTCTTTACTTCAATCAAAATTCTGAAAGCCCAGATGAGTACCTATTTAGTACTTCCAGCGAGCAGAGTATTAATAGTAATACTAACTGTGGCAACTCTGTTGATACTCAACAGACTGTTGATCCTGGTGCTTTCAGATACTGTCTTAAGACAACTATGGGGAGTTCTACTGAGAAGAGTACTGGTACAATGAACTCCTCGATGAGTAGCCATCCGGTCCCAAAGATGTTTATAGATAGCCAGTCGATATGGCAGTTTAAGAGAGGGGTGGAGGAAGCTAGCAAGTTCCTTCCTAGTAATAATGGATTGGTTATTGATTTGGAGAGCCATATGATGCCTCCAGGGTCAATGAAAGAGGCTCTGGATGTTGCAGCCGAGTCGGGGAAGGACGAAAGGGGAAACTCACCTAATGGCTCAAGGGGTAGGAAGAATCATCACCATGAGGATAGTGAATTAGAAGAGGTGAGGACTACCAAGTTTTCTGCAATTTACGTGGAAGAGTCCGAGTTATCTGACATGTTTGATAAGGTTTTGTTGTGTGATGTAAAAGCGGAGCCTACATGCTGTGATGCTGATGAAGAAGTGGAGAGTGAACAAACCAAGACGAATGGCCGACCACATGGATCTAAGGGCAGGAAGAGTCGCGCTAAGAAACAGAAAAGTAAAAACGAAGTGGTGGATTTGGTGACGCTCTTGATGAATTGCGCACAATCTGTTGCTGCAGAGGATCGCAGGAGTGCATATGAACAACTAAAGAACATTAGGCAGCACTCTTCTCCTTCAGGTGATGGGTTTCAAAGGTTGGCAAATGTCTTCGCCAATGCCCTTGAGGCACGCTTGGCTGGCACTGGAAGCCAGCTCTACGCAGCCCTAGCTTCCAAGAGGATTTCAGCAACTGAAAAGTTGAAAGCTTACCAACTTTATTTTTCAGCTAGCCCATTCAAGAAGATATGCATGTTATTTGCGAACCAAATGATTGAAGATTTAGCTATAAATTCTAGCAGAAAGAAGCTCCACGTTGTTGATTTTGGTATCCAATATGGTTTTCAGTGGCCCATGCTTATCCAACTTCTCTCAGGGCTTCCTGGCGGGGCCCCCGAACTGCGAATTACAGGGATTGAGCTTCCACAACCTGGTTTTCGCCCCACAGAGTTTGTTGAGGAGACAGGGCGTCGCCTGGCAAAGTATTGTGAACGCTTTAATGTTCGATTCCAATACCATGCCATAGCTCAAAAATGGGAAACTATCAAAATTGAGGAACTCAACCTATACAATGATGAGGCGCTTGCTGTGAATTGTCACTTCACGTTAAAGAACCGACTTCGTGATTCAGATGTGGAGGGTAGTCCAAGAGATGCTCTTTTGATGTTAATTAGGAAGATGAATCCGGACATTTTTGTCCATACTACTAACAATGGATCATACAGTCCCTTTTTTCTCAATCGGTTTCGTGAGGCTCTCTTTCACTACTCTTCACTGTTTGATATGTTTGACGCTACTCTACCCCGCGAGGATCAGGACAGGAGGAACTTTGAGCAAGAGTTCTACGGGCGCGAGGTTATGAATGTAATTGCATGTGAGGGATTGGAAAGGGTAGAGAGGCCTGAGCCTTACAAGCAGTGGCAGGTTCGCAATATGAATGCTGGTTTCAGGATCCTCCCATTGAAGCAGGATCTCGTGAAGAATCTCAGGGTTAAGGTGAAGGCAGGTTATCATAAAGATTTCGTGATCGAGGAAGATGGTCAATGGATGCTACAAGGATGGAAGGGCCGGATTTTCTGTGCTAGCTCCTGTTGGGTACCTGGATAG
Protein:  
MDQHLSHSTNSMNGFKFNDDAFSPAFDESENLANRFNFEDDTRDLNFMELPYPPHDPYPGFVNLISSGSSEVDSSDDIGLSDNILKFITTILMEENMEQKPSMFHDPLALQATEKSLYDVIGEKYPVSTSQPPLYFNHIVESPDESFFGSSSEHSSNSNSSGSFSVDSQWNVDPGEYQQSVQGYSVAYPFGTSFGSTDRFSHNGSGHRTSYKSSHLVPDIFSDRQSILEFNKGVEEASKFLPQNSQLLIDLECYDLPPESNGGAPEVVVNVEDTRESSPNGSRGKKYHHRQDSCNVEEERNSKFSAVYDDESELSEMFDKVLLIDPKVEGAHCHSDEKVLSEASKATQVTGQPQGSKGGKIRGKKRGNKTEVVDLRTLLSNCAQSVAANDRRTANEQLKQIRQHSSPSGDGFQRLANVFANGLEARLDGTGTQIYAALASKRISAAEKLKAYQLFLSCCPYKKMSIFFANKTIVDLAMNSSKKKLHIIDFGIVYGFQWPMVIQYLSTIPGGPPELRITGIELPQPGFRPAELVEETGRRLAKYCERFNVPFKYHAIAQKWETIKIEDLNLHEDEVLAVNCLFQFKNLLDETVTDDSPRDAVLKLIRKMNPDSFIQSVINGSFNSPFFVTRFREALFHYSSLFDMFDTNIPRDNQERMDFEQEFCGREITNVIACEGVERVEKPETYKQWQVRTMRAGLKMLPLSQENLKKFRRKVKAHYHNDFVIEEDGQWMLQGWKGRIFCASSCWTLSLSSTSITILLLSKKGHCALALSPRAVELHKIMDPHFSAFSYPSNGFEFDDGAFSPTFRPSENVANTFPFRDDPGDLSIMDKFLPPVPNPGFAALTLGGSSEVDSPDDSDFSDAVLKFINTTLMEENMEQTPSTFHDPLALRATEKSLYDVIGENYPASLNQPPLYFNQNSESPDEYLFSTSSEQSINSNTNCGNSVDTQQTVDPGAFRYCLKTTMGSSTEKSTGTMNSSMSSHPVPKMFIDSQSIWQFKRGVEEASKFLPSNNGLVIDLESHMMPPGSMKEALDVAAESGKDERGNSPNGSRGRKNHHHEDSELEEVRTTKFSAIYVEESELSDMFDKVLLCDVKAEPTCCDADEEVESEQTKTNGRPHGSKGRKSRAKKQKSKNEVVDLVTLLMNCAQSVAAEDRRSAYEQLKNIRQHSSPSGDGFQRLANVFANALEARLAGTGSQLYAALASKRISATEKLKAYQLYFSASPFKKICMLFANQMIEDLAINSSRKKLHVVDFGIQYGFQWPMLIQLLSGLPGGAPELRITGIELPQPGFRPTEFVEETGRRLAKYCERFNVRFQYHAIAQKWETIKIEELNLYNDEALAVNCHFTLKNRLRDSDVEGSPRDALLMLIRKMNPDIFVHTTNNGSYSPFFLNRFREALFHYSSLFDMFDATLPREDQDRRNFEQEFYGREVMNVIACEGLERVERPEPYKQWQVRNMNAGFRILPLKQDLVKNLRVKVKAGYHKDFVIEEDGQWMLQGWKGRIFCASSCWVPG